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Dedicated to Professor Friedrich Gotze on the occasion of his 60th birthday 

We establish asymptotic vertex degree distribution and examine 
its relation to the clustering coefficient in two popular random in- 
tersection graph models of Godehardt and Jaworski [Electron, Notes 
Discrete Math. 10 (2001) 129-132]. For sparse graphs with a positive 
clustering coefficient, we examine statistical dependence between the 
(local) clustering coefficient and the degree. Our results are math- 
ematically rigorous. They are consistent with the empirical obser- 
vation of Foudalis et al. [In Algorithms and Models for Web Graph 
(2011) Springer] that, "clustering correlates negatively with degree." 
Moreover, they explain empirical results on scaling of the local 
clustering coefficient of a vertex of degree k reported in Ravasz and 
Barabasi [Phys. Rev. E 67 (2003) 026112]. 

1. Introduction. In a recent paper [12], Foudalis et al. analyzed Face- 
book data and made the observation that "clustering correlates negatively 
with degree." Their empirical findings suggest that the chances of two neigh- 
bors of a given vertex to be adjacent is a decreasing function of the vertex 
degree. A reasonable question to ask is whether and how such a phenomenon 
can be explained with the aid of a known theoretical model. 

This question is addressed in the present paper. We consider two sim- 
ple random graph models admitting a power-law degree distribution and 
positive clustering coefficient. 

Given a finite set W and a collection of its subsets Di, . . . , Dn, the ac- 
tive intersection graph defines adjacency relation between the subsets by 
declaring two subsets adjacent whenever they share at least s common el- 
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ements. The passive intersection graph defines adjacency relation between 
elements of W. A pair of elements is declared an edge if it is contained in s 
or more subsets. Here s > 1 is a model parameter. Both models have reason- 
able interpretations: in the active graph, two actors Di and Dj establish a 
communication link whenever they have sufficiently many common interests; 
in the passive graph, students become acquaintances if they participate in 
sufficiently many joint projects. In order to model active and passive graphs 
with desired statistical properties, we choose subsets Di,. . . ,Dn at random 
and obtain random intersection graphs. Such random graph models have 
been introduced in [13, 19]; see also [10]. 

We remark that the adjacency relations of random intersection graphs 
resemble that of some real networks, like, for example, the actor network, 
where two actors are linked by an edge whenever they have acted in the 
same film, or the collaboration network, where authors are declared adjacent 
whenever they have coauthored at least s papers. These networks exploit the 
underlying bipartite graph structure: actors are linked to films, and authors 
to papers. Newman et al. [22] pointed out that the clustering property of 
some social networks could be explained by the presence of such a bipartite 
graph structure; see also [1, 16], and references therein. In this respect, it is 
relevant to mention that the random intersection graphs of the present paper 
can be obtained from the random bipartite graph with bipartition 1/ U M^, 
where each vertex Vi of the set V = {vi, . . . , Vn} selects the set Di CW of its 
neighbors in the bipartite graph independently at random. In addition, we 
assume for simplicity that all elements of W have equal probabilities to be 
selected. Now, the active intersection graph defines the adjacency relation 
on the vertex set V: Vi and vj are adjacent if they have at least s common 
neighbors in the bipartite graph. Similarly, vertices Wi,Wj S W are adjacent 
in the passive graph whenever they have at least s common neighbors in 
the bipartite graph. An attractive property of these models that motivated 
our study is that they capture some features of real networks and, therefore, 
they may be useful in better understanding the statistical properties of some 
social networks. 

To summarize our paper, we note that both active and passive ran- 
dom intersection graphs admit a nonvanishing clustering coefficient although 
their "clustering mechanisms" are different. Both models admit a power-law 
(asymptotic) degree distribution but of a different structure. A common fea- 
ture of these models is that the (asymptotic) degree distribution of graphs 
with a nontrivial clustering coefficient (i.e., one taking values other than 
or 1) has a finite second moment. Another interesting fact is that, in 
many cases, the chance of two neighbors of a given vertex to be adjacent 
decays as ck~^, where k is the vertex degree. For example, the scaling 
is shown for a power-law active random intersection graph with a positive 
clustering coefficient and integrable degree. This theoretical result agrees 
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with the empirical findings reported in [24] showing the scaling in some 
real networks. 

The paper is organized as follows. In Section 1, we introduce the active 
random intersection graph and collect results about the degree distribution, 
clustering coefficient and statistical dependence between the degree and the 
clustering coefficient of a given vertex. Section 2 is devoted to the passive 
random intersection graph. Proofs are given in Section 3. 

In what follows, the expressions o(-), O(-) refer to the case where the 
size m of the auxiliary set W and the number n of subsets defining the 
intersection graph both tend to infinity (unless stated otherwise). Given 
a sequence {^m} of random variables, we write = op(l) if — )• in 
probability, that is, Ve > 0,P(.^.m > e) = o(l). We write = Op{l) if the 
sequence is stochastically bounded, that is, Ve > 0,3^ > 0: sup^P(|^rn.| > 



2. Active intersection graph. Given a set of attributes W = {wi, . . . , Wm}, 
an actor v is identified with the set D{v) of attributes selected by v from W. 
We assume that the actors vi, . . . ,Vn choose their attribute sets Di = D{vi), 
1 <i <n, independently at random, and we declare Vi and Vj adjacent (de- 
noted Vi ~ Vj) whenever they share at least s common attributes, that is, 
{DiODjl > s. Here and below, s > 1 is the same for all pairs Vi,Vj. The graph 
on the vertex set V = {vi, . . . , Vn} defined by this adjacency relation is called 
the active random intersection graph; see [14]. Subsets of W of size s play a 
special role; we call them joints. They serve as witnesses of established links: 
Vi ~ Vj whenever there exists a joint belonging to both Di and Dj. 

We first assume, for simplicity, that the random sets Di,. . . , Dn have the 
same probability distribution of the form 



That is, given an integer k, all subsets ^ C of size |^| = A; receive equal 
chances, proportional to the weight P{k), where P is a probability on {0, 1, 
. . . , m}. The random intersection graph defined in this way is denoted Gs{n, 
m,P). One special case where all random sets are of the same (nonran- 
dom) size has attracted particular attention in the literature (see, e.g., [3, 
10, 14, 23, 25, 30]), as it provides a convenient model of a secure wireless 
network. We call it the uniform active random intersection graph and de- 
note Gs{n,m,6x)- Here 6x is the probability distribution putting mass 1 on 
X, and X is the size of the random sets. 

2.1. Degree distribution. We consider a sequence of random intersection 
graphs, where m and n = Um tend to infinity, and where s = s^n. and P = Pm 
depend on m. We suppress the subscript m in what follows whenever this 



A)<e. 



(2.1) 




for AcW. 
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does not cause an ambiguity. By = | | we denote the size of the attribute 
set Di. Note that Xi is a random variable taking values in {0, 1, . . . , m} and 
having the probability distribution P. 

Lemma 1. Let m — )• +oo. Assume that s = 0(1) and 'EXlIj^Xi>e,/m} — 
o(EXf) for any e > 0. Then for any pair of vertices Vi, vj , the edge probability 
in Gs{n, m, P) 

(2.2) P(z;,~t;,) = (l+o(l))(^) '(e(^i))'. 

In particular, we have P(fj ~ vj) = o(l). 

We note that E("'^*) is the expected number of joints available to the 
vertex Vi. (2.2) may fail in the case where s = Sm ^ oo as m— )-oo; see 
Example 2 below. 

Remark 1. In the particular case where s remains fixed as n,m — oo, 
Lemma 1 suggests that a sparse graph Gs{n,m, P) is obtained when the 
random sets are of order 

(2.3) |A|=Op(mi/'n-'/('^)). 

By sparse we mean that the degree of the typical vertex is stochastically 
bounded as n — t- oo . 

Our next result shows that the degree of the typical vertex of a sparse ran- 
dom intersection graph is asymptotically Poissonian with (random) intensity 
parameter ZiEZi. Here Zi = Zmi = (™)~^''^?^"^^^("^') denotes the properly 
rescaled number of joints of the vertex Vi, 1 <i <n. Let d{vi) = dm{vi) de- 
note the degree of the vertex Vi in Gs{n,m, P). Observe that, by symmetry, 
the random variables d{vi), . . . , d{vn) have the same probability distribution. 

Theorem 2.1. Let m,n^ oo. Assume that s = 0{1) and: 

(i) Zmi converges in distribution to a random variable Z; 

(ii) EZ < oo and limm_j.oo EZ^i = EZ. 
Denote = EZ. We have, for k = 0,1,2 ... , 

(2.4) hm Pidmivi) = k) = (fc!)-iE((Z^i)'=e-^''). 

m— >oo 

Formula (2.4) remains valid if the condition s = 0{l) is replaced by the 
following weaker condition: there exists < a < 1 such that s < am and 
(sl/n)^/^ = o(m). In this case, we require, in addition to (i) and (ii), that: 

(iii) Ek=2iZmiZ^kY^+'^/' = op(l). 
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Formula (2.4) relates the vertex degree distribution to the distribution of 
sizes of the random sets. In particular, the more variable is the sequence 
of realized values Xi, . . . ,Xn, the more irregular is the sequence of vertex 
degrees of a realized instance of the intersection graph. For example, we 
obtain a power-law distribution pf^ ~ c/c~'^ as /c — )• oo whenever the limiting 
distribution of {Zmi}m has a power law P(Z > t) ~ c't^~'^ as t — )• oo. Here 
7 > 2, and c, c' denote some positive constants. Observe, that the asymptotic 
degree distribution defined by formula (2.4) has a first moment. 

In the case where s = 1, the asymptotic degree distribution of an active 
random intersection graph was shown (in increasing generality) in [4, 9, 17, 
27]; see also [6] and [26]. In particular, the result of Theorem 2.1, for s = 1, 
can be found in [4, 6] and [26]. For s > 1, the result of Theorem 2.1 is new. 
It also applies to the case where s = +oo as m — )■ oo . We note that 

limit (2.4) may fail when s = Sm grows to infinity "sufficiently fast;" see 
Example 2 below. A general sufficient condition for the convergence to the 
Poisson mixture (2.4) is given in Theorem 2.2. 

In Theorem 2.2, we assume that the random sets Di, . . . ,Dn defining 
the intersection graph are independent but not necessarily identically dis- 
tributed. We write, as before, Xk = \Dk\ and assume that for every k = 
l,...,n and each AcW,we have P(L>fc = A) = {J^^)~'^P{Xk = \A\). Let P 

denote the distribution of the random vector X = {Xi, . . . , Xn)- By Gs{n,m, 
P) we denote the random intersection graph on the vertex set V, where 
Vi,Vj G V are adjacent whenever {DiCi Dj \ > s. By d{v) = dm{v) we denote 
the degree of G y in Gs{n, m, P). For a vector with nonnegative integer co- 
ordinates X = {xi, . . . , Xn), we write Uk = (^/ )(^j'" ) and = max{0, Xk — s}, 
and we denote 



-in . s _2 n 

k=2 ^ ' k=2 



^(^)=f^) ^Uk, Kl(x) 



K2(^) = ^ ( ) y^Ukx\. 

Our next result applies to a sequence of random intersection graphs Gs{n,m, 
P), where m — )• oo and n = rim oo, and where s = Sm. and the distribution 
P of the random vector X depend on m. 

Theorem 2.2. Assume that, as m^oo: 

(iv) X{X) converges in distribution to a random variable A; 

(v) Ki(X) = op{l); 

(Vi) K2iX) = Op{l). 

Then we have for A; = 0, 1, 2, ... , 
(2.5) lim P(dm{vi) = k) = {k\)-^E{A''e-^). 

m— >oo 
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We briefly remark that condition (v) is a kind of "asymptotic negligi- 
bility condition" imposed on the sequence of random variables {^g){^J'), 
2 < k < n. Condition (vi) ensures that (the distributional limit of) X{X) 
alone determines the asymptotic degree distribution; see also Lemma 6 be- 
low. 

In Examples 1 and 2 below, we consider uniform random intersection 
graphs. In Example 1, we formulate, in terms of n = rim, x = Xm, and s = Sm, 
conditions that are sufficient for the convergence (2.5). Example 2 shows that 
conditions (iv) and (v) alone do not suffice to establish the convergence (2.5). 
Here we are in a situation where the edge probability formula (2.2) fails. 

Example 1. Let {xm} and {sm} be integer sequences such that 1 < 
Xm - Sm = o{{m - XmY^"^) and (^™) (s™ )^ = o(l) as m oo. Let Um be 

an integer sequence such that the limit limm nm(^™ )^( )~^ exists and 
is finite. We denote this limit by A. The sequence of random intersec- 
tion graphs {G s„X''^m-,fn, 5x^)} satisfies the conditions of Theorem 2.2 with 
A = A. Therefore, the degree of the typical vertex has limiting Poisson dis- 
tribution with mean A. 

Example 2. Given m, let s = 0.5m, x = {e + 0.5)m. Here e G (0, 1) 
is a small absolute constant. We choose e small enough so that p*{m) := 
(s) ~ as m — )• oo. Next, we choose an integer sequence Um t +oo 

such that nmP* {m) — )• 1 as m — )• oo and consider the sequence of random in- 
tersection graphs {Gs{nm,'rn,5x)}m- It follows from the relation nmP*{fn) — )• 
1 that conditions (iv)-(v) hold with A = 1. Moreover, we show that dm{vi) = 
op(l) as m —7- oo. The latter relation contradicts to (2.5). Note that for this 
sequence of random intersection graphs, condition (vi) fails. Indeed, we have 

K2(^) = (f5^"-mP*("i) — )■ +00 as m — )■ oo. The proofs of the statements of 
Example 2 are given in Section 3. 

2.2. Clustering coefficient. Typically, the adjacency relations between 
actors in real networks are not statistically independent events. Often, chances 
of a link v' ~ v" increase as we learn that actors v' and v" have a common 
neighbor, say, u. As a theoretical measure of such a statistical dependence, 
one can use the conditional probability (see, e.g., [9]) 



In the literature (see [2, 20, 21, 29]), the empirical estimates of the condi- 
tional probability a. 



a = P(f' ~ v"\v' ~ V, v" ~ v). 




and a 
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are called the clustering coefficient and the global clustering coefficient, re- 
spectively. Here n denotes the number of vertices of a graph, N3{v) is the 
number of unlabeled triangles having vertex v, N2{v) is the number of un- 
labeled 2-stars with the central vertex v. In this paper, the term clustering 
coefficient is used exclusively for the conditional probability a. 

Note that, in the random graph Gs{n,m,P), the conditional probability 
a does not depend on the choice of v,v',v" . It does not depend on n either. 
We write a = as{m, P) in order to indicate the dependence on s, m and P. 

We begin our analysis with the uniform random intersection graph Gs{n, 
m,6x) where, for large m, the asymptotics of the clustering coefficient is 
simple and transparent. Since we are interested in sparse random graphs, 
we may assume that P(fj ~ Vj) = o(l) as m — )■ oo. Observe that in the 
case of bounded s [i.e., s = 0(1) as m — )• oo], this assumption implies that 

= o(m); see (2.3). In the following lemma, we consider a sequence of 
uniform random intersection graphs, where x = Xm and s = Sm depend on 
m, and {x — s)^ = o(m — x). Note that, for bounded s, the later condition is 
equivalent to = o(m). 

Lemma 2. Let m — > +oo. Assume that s < x and (x — s)^ = o(m — x). 
Then 

(2.6) a,(m,5,) = (^^^ +o(l). 

The assumption s < x of the lemma excludes the trivial case x = s, where 
we have as{m, 6x) = 1- Lemma 2 brings some insight into the general model 
Gs{n,m,P). Namely, in the case of a sparse random graph, it suggests that 
the clustering coefficient is nonvanishing whenever the sizes of random sets 
are stochastically bounded as m — t- oo. In fact, we need to impose an even 
stronger condition, which requires, in particular, that the moment of order 2s 
of P = Pm is bounded as m — >• oo. For k = l,2, we denote = / {^)^P{dx) = 

E(^^)^ 

Lemma 3. Assume that for k = 1,2, we have, as oo, 

(2.7) Ofc > and Ve G (0, 1) — / (^^ P{dx) = o{l). 

a-k Jx>e^ V ^ / 

Then 

(2.8) as{m,P) = ai/a2 + o{l). 

Note that invoking in (2.8) the simple inequality af < a2, we obtain 

— 1/2 

as{m,P)<a2 + o(l). Hence, 02 —)• oo implies as(m, P) —)• 0. 

Up to our best knowledge, the first result showing that a power-law (ac- 
tive) random intersection graph with m ~ f3n, f3 > admits a nonvanishing 
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clustering coefficient is due to Deijfen and Kets [9]. They established a first- 
order asymptotics of ai{m,P) as m — t- oo in the particular case where P 
is a mixture of binomial distributions. Yagan and Makowski [30] evaluated 
the clustering coefficient ai{m,5x) of a uniform (active) random intersec- 
tion graph and proved that it is always positive. Nonvanishing clustering 
coefficients of a random intersection digraph were studied in [5] . The effect 
of a positive clustering coefficient on the size of the largest component of 
Gi{n,m,P) was studied in [6]. The effect on an epidemic spread was con- 
sidered in [8]. 

2.3. Clustering coefficient and degree. Here we consider a sequence of 
sparse random intersection graphs {Gs{n,m,P)}m with nonvanishing clus- 
tering coefficient and nondegenerate asymptotic vertex degree distribution. 

In our next theorem and its corollary, we express an approximate formula 
for the clustering coefficient (2.8) in terms of moments of the asymptotic 

vertex degree distribution. Recall the notation Zi = {'^J^){"^)~^^'^n^^'^ ■ Here 
Xi is a random variable with the distribution P. 

Theorem 2.3. Let /3 > 0. Let m — > co. Assume that s = 0(1) and 

(2.9) (T)'^"^^- 

Suppose that conditions (i) and (ii) of Theorem 2.1 hold. In the case where: 
(ii') < EZ2 < oo and lim.;„^oo EZf = EZ^ 
we have 

1 EZ 

(2.10) a,{m,P) = -^^ + o{l). 

In the case where EZ^ = oo, we have as{m,P) =o(l). 

Remark 2. Observe that under conditions (i) and (ii'), the asymptotic 
degree distribution exists and is defined by (2.4). Indeed, (i) and (ii') im- 
ply (ii), and, therefore, one can apply Theorem 2.1. Furthermore, EZ^ < oo 
implies that the asymptotic degree distribution has a second moment. As- 
suming, in addition, that (2.9) holds, we conclude from Theorem 2.3 that 
the clustering coefficient does not vanish in the case where the (asymptotic) 
degree distribution has finite second moment. Moreover, if (2.9) fails and 
we have n = o{{"^)), then the clustering coefficient vanishes [i.e., a = o(l) as 
n,m^ oo] in the case where (i) and (ii') hold, as well as in the case where 
(i), (ii) hold and EZ^ = oo. The proof of this statement is given in Section 3. 

Next we express (2.10) in terms of moments of the asymptotic degree 
distribution. Let be a random variable with the asymptotic degree dis- 
tribution defined by (2.4), that is, we have 

(2.11) P{d. = k)=pk, pfc = (fe!)-^E((Z/i) V^/^), A: = 0,1, 2,.... 



DEGREE AND CLUSTERING 



9 



Then Ed^ = (EZf and Edl = (EZfEZ^ + (EZf. Invoking these formulas 
in (2.10), we obtain the following: 

Corollary 1. Let m,n ^ oo. Assume that s = 0(1) and conditions 
(i), (ii') and (2.9) hold. Then we have 

Foudalis et al. [12] made an interesting observation, based on an empiri- 
cal study of a power-law network data, that "clustering correlates negatively 
with degree." More precisely, their numerical data suggest that the condi- 
tional probability 

(2.13) a^^"^ =Y'{v' ^ v"\v v\v v" ,d{v) = k) 

is a decreasing convex function of (the realized value k of) the degree d{v) of 
vertex v. Theorem 2.4 establishes a first-order asymptotics of the conditional 
probability a^^'^ = a^^^ (m, P) as m, n — )• oo for Gs{n, m, P) model. The result 
of Theorem 2.4 provides a rigorous argument explaining empirical findings 
of [12] mentioned above; see also Example 3 below. 

Theorem 2.4. Let /3 > 0. Let m — )• oo. Assume that s = 0{1) and that 
conditions (i), (ii') and (2.9) hold. Then we have, for every A; = 2, 3,..., 

(2.14) aW(rn,P) = iM^^ + ,(l). 

k V/5 Pk 

Note that, for a power-law asymptotic degree distribution pk ~ ck~^ , 

(2.14) implies 

(2.15) aW(m,P) ~c'fc"^ as A; ^ oo 

for any 7 > 3. Here c' = /3~^/^EZ. Hence, q['1 "correlates negatively with 
degree." 

Next we illustrate (2.14) by two examples, where we assume, for simplicity, 
that s = 1 . 



Example 3. Fix 7 > 3 and ^ > 0, and consider a sequence of random 
intersection graphs satisfying the conditions of Theorem 2.3 with Z having 
the power-law distribution P(Z > t) = (9ty~'^ for t>6~^. In this case, we 
have Pk ~ (7 - l)9^~'^k-^ as A: 00. Now (2.14) implies (2.15). 

Example 4. Fix n> and consider a sequence {Gi (n, m,5x)} such that 
x(?7-/m)^/^ H as m — 7- 00. The sequence satisfies the conditions of Theo- 
rem 2.1 with Z having the degenerate distribution F{Z = fi) = 1. Hence, by 
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Theorem 2.1, the asymptotic degree distribution is Poisson with mean /i^. 
In this case, we have Pk-i/Pk = kfJ.'"^, and (2.14) imphes a^'^] w fi~^. Hence, 
does not correlate with degree. 

The rationale behind these examples is as follows. In a sparse active graph, 
an edge is typically realized by a single joint shared by the two vertices linked 
by this edge. Similarly, a triangle is realized by a single joint shared by the 
three vertices of the triangle. Therefore, two neighbors, say, Vi and Vj of a 
given vertex vt establish a link whenever the joints responsible for the edges 
Vi ~ Vt and Vj ~ vt match. In particular, given Xt = \Dt\, the probability 

that two neighbors of vt are adjacent is approximately (^*) . Here ("^*) 
is the number of joints available to vt. Next, we remark that given Xt, the 
number of neighbors of vt has binomial distribution Bin(n — l,P(t)) with 

P{t) ~ ); see Lemma 6 below. For large values of Xt, this 

number concentrates around its mean because of the concentration property 
of Binomial distribution. Hence, for large values of Xt, the degree d{vt) scales 
as (n — l)p(j) = {'^s^)t, where r /3~^/^EZ as n,m^ +oo. In particular, 
given a vertex vt with a large degree, say, d{vt) = fc, it is reasonable to expect 

that (n — ^)p{t) ~ k, that is, {J) ~ k~ t. We summarize the argument 
as follows: The probability that two neighbors of a vertex of degree k are 

adjacent scales as ) ~ k~^T, for k — )• +oo. This explains formula (2.14) 
in the case where the sequence Xi , X2 , ■ ■ ■ , Xn exhibits a high variability, for 
example, it is a sample from a heavy-tailed distribution. The argument fails 
in the case of uniform random intersection graphs, since here all attribute 
sets are of the same size [Xi = const) and, hence, the realized values of d{vt) 
and Xt do not correlate. 

Remark 3. Using the identity 'Ed^ = (EZ)^, we can express (2.14) 
solely in terms of the (asymptotic) degree distribution 



In the case where m is large and the distribution (2.11) is close to that 
of the degree sequence of the observed graph, we can replace the moments 
Ed*,Ed2 and probabilities P(4 = k), P((i* = k - I) in (2.12) and (2.16) 
by their estimates based on the observed degree sequence. In this way, we 
obtain estimates of a and a^^'^ based on the degree sequence and involving 
the parameter (3. 

Finally, we note that the second moment EZ^ < 00 required by Theo- 
rem 2.4 does not show up in (2.14). This observation suggests indirectly 
that the second moment condition could perhaps be replaced by the weaker 
first moment condition. 
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2.4. Concluding remarks. We remark that the asymptotic degree distri- 
bution oi Gs{n,m,P) as n, m — ?• oo is determined by the hmiting distribution 
of the properly scaled number of joints of a typical vertex, that is, by the 
limiting distribution of Zi = {"^)~^n^^'^{^g^), which we denote Pz- Further- 
more, the first-order asymptotics as n, m — t- +oo of the clustering coefficients 
a and a^^^ is determined by Pz and (3 = lim(™)7i~^. One may observe that 
the parameter s does not show up in an explicit way neither in the descrip- 
tion of the asymptotic degree distribution nor in the asymptotic formulas 
for a and at*^!. We expect that the role of the parameter s becomes more im- 
portant in the case of denser graphs, for example, in the case where (™)n~^ 
converges to zero sufficiently fast. 

3. Passive intersection graph. In this section, we consider graphs on the 
vertex set W = {wi, . . . , Wm}- Let s > 1 be an integer. Let Di, . . . , Dn be 
independent random subsets of W having the same probability distribu- 
tion (2.1). We say that vertices w,w' £W are linked by Dj if Wjw' £ Dj. 
For example, every w' € Dj \ {w} is linked to w by Dj. The links created by 
Di,. . . ,Dn define a multigraph on the vertex set W. In the passive random 
intersection graph, two vertices w,w' €W are declared adjacent whenever 
there are at least s links between w and w'; that is, the pair {w,w'} is con- 
tained in at least s subsets of the collection {Di, . . . , Dn}; see [14]. We denote 
the passive random intersection graph G*{n,m,P). Here P is the common 
probability distribution of the random variables Xi = \Di\,. . . ,X„ = \Dn\. 
We shall consider only the case where s = l. 

Before presenting our results, we introduce some notation. With a se- 
quence of probabilities Q = {qo.,qi, ■ ■ .} such that qj = 1 and fj.Q = jqj < 

oo, we associate another sequence of probabilities Q = {qo,qi, ■ . .} obtained 
as follows. In the case where hq > 0, we define qj = {j + l)gj+i//iQ, j = 
0, 1, 2, . . . . For /ig = 0, we put % = I and (jj = 0, j = 1, 2, In particular, 

we denote by P = Pxi the probability distribution on {0,1,2,...} putting 
mass (j + l)P(Xi = J + 1) /EXi on an integer j = 0, 1, 2, . . . . By P^ we denote 
the probability distribution of a random variable ^. 

3.1. Degree distribution. Let d = denote the degree of i^i in GJ(n, 

m,P), and let L = L{wi) denote the number of links incident to wi. Our 
next theorem establishes the asymptotic degree distribution of a sequence 
of sparse passive random intersection graphs in the case where m and n 
are of the same order. By "sparse" we mean that the degree d remains 
stochastically bounded, that is, d = Op(l) as m,n^ oo. 

Theorem 3.1. Let f3 > 0. Let m, n — )■ oo. Assume that mn~^ — )• /3 and: 

(vii) Xi converges in distribution to a random variable Z . 

(viii) EZ < oo and limm-5.00 EXi = EZ. 
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Then L converges in distribution to the compound Poisson random vari- 
able = Ylf=i^j- Here Zi,Z2,-- - are independent random variables with 
common probability distribution Pz, the random variable A is independent 
of the sequence Zi,Z2,.-- and has Poisson distribution with mean EA = 
p-'^BZ. 

If, in addition, 
(ix) ^ ^ lim„,_,oo EXf^^ = EZ^/^^ 

then d converges in distribution to d*. 

Theorem 3.1 shows how the (asymptotic) vertex degree distribution de- 
pends on the distribution of sizes of the random sets. In particular, we obtain 
a power-law degree distribution whenever the distribution of the sizes has 
a power-law. Moreover, since the distribution of Zi has a much heavier tail 
than that of Z, we can even obtain (asymptotic) degree distribution with 
infinite first moment. We refer to [11] for a survey of results on local and tail 
probabilities of sums of heavy-tailed random variables, including, in partic- 
ular, random sums and compound Poisson random variables. 

Remark 4. Note that (vii) and (ix) imply (viii). Hence, condition (ix) 
is more restrictive than (viii). The fact that the asymptotic distribution of d 
only refers to the first moment of Z suggests indirectly that the 4/3 moment 
condition (ix) could perhaps be waived. 

In the next lemma, we collect several facts about the degree distribution 
in the case where either n = o{m) or m = o{n). 

Lemma 4. Let m,n — t- oo. In the case where n = o{m), we have 

(3.1) Ve>0 Pidyemn-^ -l)>P{d>l)-e. 

Now assume that m = o{n). Denote n* = nP(Xi > 2). We distinguish 
three cases: (a) n* = o{m); (b) m = o{n^); (c) mn~^ = /3(1 + o(l)) for some 
/3 > 0. In the case (a), we have 

(3.2) Ve>0 P(d> em(max{l,n4)"^ - 1) > P(d> 1) -e. 
In the case (b), we have 

(3.3) VC>0 P(d>C) = l-o(l). 

In the case (c), the conclusion of Theorem 3.1 holds if the conditions of the 
theorem are satisfied with the random variable Xi replaced by the random 
variable Xi^ having the distribution P(Xi* =i) = P(Xi = i\Xi > 2), i>2. 

Remark 5. Observe that any of inequalities (3.1), (3.2) and (3.3) rules 
out the option of a nondegenerate and stochastically bounded d as m,n— )• 
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oo. Therefore, a nontrivial asymptotic degree distribution is possible only 
in the cases where ci < m~^n < C2 or ci < m~^nP{Xi > 2) < C2 as n, m — t- 
oo for some ci,C2 > 0. Furthermore, in the second case, the number N = 
J27=i^{Xi>2} of random sets that define the edges of Gl{n,m, P) concen- 
trates around the expected value n* = EA^ = nP{Xi > 2), and, for this rea- 
son, the asymptotic vertex degree distribution of (n, m, P) is the same as 
that of Gl{[n^\,m, P^:). Here [n^,J denotes the largest integer not exceeding 
n* , and P* denotes the distribution of Xu ■ Let us note that the asymptotic 
degree distribution of G*( [n*J , m, P*) can be obtained from Theorem 3.1. 

The asymptotic degree distribution of the passive random intersection 
graph Gi (n, m, P) has been studied by Jaworski and Stark [18] . They showed 
a necessary and sufficient condition for the convergence of the degree dis- 
tribution and determined conditions for the convergence to a Poisson limit. 
They also asked what other possible limiting distributions are. Theorem 3.1 
and Lemma 4 answer this question in the particular case of sparse graphs. 
Let us mention that the approach used in the proof of Theorem 3.1 is dif- 
ferent from that of [18]. 

3.2. Clustering coefficient and degree. The clustering coefficient 

a* = a* {n,m, P) = P{w2 ~ t^sltfi ^ W2,wi ~ it's) 

of a passive random intersection graph Gi {n, m, P) has been studied in the 
recent paper by Godehardt et al. [15]. They showed, in particular, that 

,,,, *r /3>~HE(Xi)2)^ + E(Xi)3 , „ _i 

(3.4) «(n,m,P)= ^^(e(x^)^)2 + e(X,)3 ^* ^= ""^ ' 

provided that E(Xi)2 > and E(Xi)2 = o{rn?n~^) as m,n^ oo; see The- 
orem 2 and Corollary 1 in [15]. Here and below we denote {x)k = x{x — 
l)---{x-k + l). 

We are interested in the relation between the clustering coefficient and the 
degree. Our first result expresses (3.4) in terms of moments of the (asymp- 
totic) degree distribution. 

Theorem 3.2. Let /3 > 0. Let m, n — )• oo so that mn~^ — )• /?. Suppose 
that condition (vii) of Theorem 3.1 holds and that, in addition, < EZ^ < oo 
and limm-s.oo EX^ = EZ^. Then the degree d converges in distribution to the 
compound Poisson random variable defined in Theorem 3.1. 



In the case where P{Z > 2) > 0, EZ^ < oo and limm^^oo ^Xf = EZ^, we 



73 

have 

/Q c:\ */ m E(4)2 - (E(i*)2 
(3.5) a (n, m, P) = + o(l) 

In the case EZ^ = oo, we have a* (n, m, P) = 1 + o(l) . 
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Observe that EZ^ < oo if and only if Ed^ < oo. Hence, in order to obtain 
the clustering coefficient a* <1 (as n,m^ oo), we need to require Ed^ < oo. 

Remark 6. The result of Theorem 3.2 extends to the case where m = 
o(n) and mn~^ — t- /3. Indeed, in this case, one can apply Theorem 3.2 to the 
random graph G|([n*J , m, i-**); see Remark 5. 

Next, we examine the conditional probability 

= a*^^\n, m, P) = P{w2 ~ t/^s | ~ it;2 , li'i ~ W3,d{wi) = k). 

Theorem 3.3. Let /3 > 0. Let m,n — )• oo so that mn~^ — t- /?. Assume 
that condition (vii) of Theorem 3.1 holds. Assume, in addition, that the 
random variable Z defined by (vii) has the third moment, P(Z > 2) > 
and limm-^-oo EXf = EZ^. Denote ^2* = Ylf=ii^j)2! where A, Zi, Z2, ■ ■ ■ are 
defined in Theorem 3.1. Then for every k = 2,3, . . . satisfying P((i* = k) > 0, 
we have 

(3.6) a*W(n,m,P) = ^-J—^E{d2,\d* = k) + o(l). 

Example 5. Given /3 > and an integer x >2, consider the random 
graph G*{n,m,6x), where m= [f3n\ as m,n— )-oo. By Theorem 3.1, the 
degree d converges in distribution to the random variable = {x — 1)A. 
Furthermore, we have d2* = (x — 1)2A. Here the random variable A has 
Poisson distribution with mean /3~^(x — 1). It follows from (3.6) that for 
every k = t{x — 1), t = 1, 2, . . . , we have 

X — 2 

(3.7) a*^''Un,m,6x) = l-o(l) asm,n— s-oo. 

k — 1 

Hence, in this case, a*[^l is of order k~^ as — )• 00. Therefore, a*''! "correlates 
negatively with degree." 

In order to explain (3.7), we first note that a triangle in a sparse pas- 
sive random intersection graph is typically realized by a single set Dj that 
covers all three vertices. Furthermore, with a high probability, any two sets 
Di,Dj that cover a given vertex wi have no other element in common, that 
is, Di n Dj = wi. Therefore, almost all triangles incident to wi are realized 
by the sets Di^ , Di^ that cover wi , and the number of such triangles 
is approximately the sum of numbers of pairs {w' ,w"} covered by Dij, 

l<j<t. Hence, ^ ('^'ij"^) H h {^^'*}"^). In addition, the degree 

d{wi) w \Di^ I — 1 + • • • + \Di^ I — 1- Hence, given Di-^ , . . . , Di^ , the probabil- 
ity that a pair of neighbors of wi makes a triangle with wi approximately 
equals iVA/C^^^^)- In the case where the sizes of random sets do not devi- 
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ate much from their average value EXi, we write \Dij \ ~ 'EXi and obtain 

- (E^i - - 1). This explains and generalizes (3.7). 

In the case where sizes of random sets are heavy-tailed random variables, 
we expect a different pattern for large values of d^wi). Now the maximal 
size maxi<j<( is of the same order as the sum ^i<j<t\-^ij\- ^^^^ 
reason, the fraction N/^/C^^^^^) stays bounded away from zero. Moreover, 
one may expect that, in this case, a*'*^] — t- 1 as /c — t- 00. 

4. Proofs. We begin with general lemmas that are used in the proofs 
below. Then we prove results for the active graph. Afterwards, we prove 
results for the passive graph. We note that the notation introduced in the 
proof of a particular lemma or theorem is only valid for that proof. 

4.1. General lemmas. The following inequality is referred to as LeCam's 
lemma; see, for example, [28]. 

Lemma 5. Let S = Ii + I2 + ■ ■ ■ + In be the sum of independent random 
indicators with probabilities P(Ij = 1) =pi. Let A be Poisson random variable 

with meanpi-\ Vpn- The total variation distance between the distributions 

Ps and Pa of S and A satisfies the inequality 

(4.1) dTY{Ps,PA)<2j2pl 

i 

Lemma 6. Given integers 1 < s < di < d2 < rn, let Di,D2 be indepen- 
dent random subsets of the set W = {1, . . . ,m} such that Di (resp., D2) is 
uniformly distributed in the class of subsets ofW of size di (resp., d2). The 
probabilities p := P(|-Di r\D2\ = s) and p := P(|-Di H D2\ > s) satisfy 

{ di - s){d2 
m+1 — di 



where we denote pX^,^^^ = {^l){''^^)r,r^ . 

Proof. It suffices to establish inequalities (4.2) for conditional proba- 
bilities given D2. In order to prove the left inequality, we write p = 

i1)&izr'=ypk,d.,s^^^^'^ 

^_'^'tt'Yi d2-s \ d2-s {d2-s){di-s) ^ 

V m — s — i 1 ^ m — s — i m + 1 — di 

i=0 i=0 

Let us show the right inequality of (4.2). Since, for every D C D2 of size 
\D\ = s, the number of subsets of W of size di that contain D is at most 
(^Is), we conclude that p< C^f )(^^'l^)(^)"^ Note that the quantity in 
the right-hand side of the latter inequality equals p*^_^ ^ . □ 
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Lemma 7. Let {Xni,Xn2, ■ ■ ■ ,Xrm}n>i be a collection of nonnegative 
random variables such that, for each n, the random variables Xni, ■ ■ ■ , X^n 
are independent and identically distributed. Let a > 1. Let Sa = n~'^{X^^ + 

\-X^^). Assume that EX„i < oo for each n and that the sequence {Xni}n 

converges in distribution to a random variable Z. Assume, in addition, that 
EZ < oo and lim„ EXni = EZ. Then 



(4.3) sup'EXnil{x„i>x} -^0 as +oo, 

n 

(4.4) Si-EZ = op{l) asn^+oo, 

(4.5) Va > 1 Sa = op{l) asn^+co. 



Proof. Relations (4.3) and (4.4) are shown in the proof of Remark 1 
in [4] and Corohary 1 in [5]. Let us prove (4.5). We shall show that P{Sa > 
e) < eEZ + o(l) as n — )• oo for each e £ (0, 1). Given e, introduce the event 
B = {maxi<j<.„ Xni < e'^^^"~^^n} and note that Sa < e'^Si when the event B 
holds. Hence, we have 

P{Sa >e)< P{{Sa >e}nB) + P{B) < P(5i > e'^) + P{B). 

Here the complement event B has the probability 

P(B) < nP(X„i > e2/("-i)n) < e~^/^^~'^EXr,.,I^^^^^,2/i.-^)„^ = o(l). 

In the last step, we applied (4.3). Finally, the bound P(5i > e^^) < eESi = 
e(EZ + o(l)) completes the proof. □ 

4.2. Active graph. We first note that Lemma 1 is an immediate con- 
sequence of Lemma 6. Next, we prove results on the degree distribution: 
Theorems 2.2 and 2.1 and Example 2. Afterwards, we prove the statements 
related to clustering coefficient. 

Proof of Theorem 2.2. Let doo be a random variable with the dis- 
tribution defined by the right-hand side of (2.5). We write, for short, dm = 
dm{vi). Let fm{t) = Ee**'^"' and f{t) = Ee^^^°° denote the Fourier transforms 
of the probability distributions of d^ and doo ■ In order to prove the theorem, 
we show that lim-m fm{t) = f{t) for every real t. 

Given < 5 < 0.01 and an integer m, introduce the event A = {ni{X) < 
6,i = 1,2} . Note that, by (v) and (vi), we have P(^) = 1 — o(l) as m — )■ oo. 
Therefore, we write 

(4.6) /„(t)=E(e^*'^'"I^) + o(l). 

On the event A, we approximate the conditional characteristic function 

fm{t;x):=E{e'"'-\X = x) 
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by the Fourier transform of the Poisson distribution with mean X{x), 

gm{t;x) = exp{A(x)(e** - 1)}. 

Since the conditional distribution of dm, given the event {X = x}, is that of 
the sum of independent BernouUi random variables with success probabilities 

qk = Pi\DinDk\ >s\Xi=xi,Xk = Xk), 2<k<n, 

we write 

fmit;x)= H (l + g,(e**-l)) = exp| ln(l + gfc(e**-l)) 

2<k<n ^2<k<n 

Note that (4.2) implies 

It follows from the right-hand side inequality of (4.7) and the inequality 
(™)~ u\<5< 0.01, which holds on the event A, that for each k, we have 
^^(e** — 1)1 < 0.5. Invoking the inequality | ln(l + z) — z\< \z\^ for complex 
numbers z satisfying \z\ < 0.5 (see, e.g.. Proposition 8.46 of [7]), we obtain 
from (4.7) that 

fm{t;x) = exp{A(x)(e** - 1) + r{t)}, 

where \r{t)\ < 4ki(x) + 2k2{x). Now, the inequalities Ki{x) < 5, i = 1,2, 
which hold on the event A, imply that |/m(t;x) — gm{t;x)\ < 75. Invoking 
this inequality in (4.6), we obtain 

|/„^(t)-Eexp{A(X)(e**-l)}| <75 + o(l) asn^oo. 

Finally, the convergence in distribution of {A(X)} [i.e., condition (iv)] im- 
plies the convergence of the corresponding expectations of bounded contin- 
uous functions. Therefore, lim^ Ee'*'^'''")*^^' = f{t). We obtain the inequal- 
ity limsup^ \ fm{t) — f{t)\ < 75, which holds for arbitrarily small > 0. The 
proof of Theorem 2.2 is complete. □ 

Proof of Theorem 2.1. Throughout the proof, we assume that the 
integ 6rs Ti — 'i^m-j ^ — ^m,5 S'lid. th.6 distributioiis of random variables (^i, • • • ; 
Xn) = X and Zi = {^;)n^/^{"^)~^^^, 1 < i < n, ah depend on m. 

We derive Theorem 2.1 from Theorem 2.2. To this aim, we verify condi- 
tions (v) and (vi) and show that condition (iv) holds with A = fiZ. The fact 
that (i) and (ii) imply (iv) and (v) follows from Lemma 7. Here we show that 
(i), (ii) and (iii) imply (vi) in the case where s < am and (sl/n)^/^ = o{rn). 
In addition, we show that (i) and (ii) imply (iii) in the case where s = 0(1). 
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Note that s = 0(1) means that the sequence {sm} is bounded, and this is a 
much more restrictive condition than s < am and (s!/n)^/'' = o{m). 

We start with the observation that the inequahty 1 — < \/l — m^^, 
which holds for 1 <y <m, imphes the inequahty 

(4.8) J1L = J^)£ 1 > ( ^-^ + L yj_ VxG[s,m]. 

^JCT) Vi^sVs^- V Vm -s + lj Vs! 

In order to show (vi), we prove that 

(4.9) VeG(0,l) hmsupP(K2(X) >e) <e. 

Here and below, the limits are taken as m — )• oo. Given e, we find a (suffi- 
ciently large) constant A> such that P{Zi > ^) < A~^YjZ\ < e uniformly 
in m. Here we applied Markov's inequality and used (ii). In view of the 
inequality P(Zi > A) < e, we can write 

(4.10) P{K2{X)>e)<e+p„ 

where Pe ■= P{{k2{X) > e} n {Zi < A}). Observing that on the event {Zi < 
A} we have, by (4.8), 

, / , \l/s 

^1 - s + 1 < Vm-s + l( ^ Vs!/ 



we obtain from the bound (sl/n)^/^ = o(m) the inequality Xi < am + o{m). 
It follows now that 

(Xi-. + l)(,Y.-.+ l) _ + + 

m-Xi + l ' ^ " m 

Furthermore, invoking the inequality 



(4.11) (X.-, + l)(X.-. + l) /^,(;V)(_ 

m V f 



1/. 



which follows from (4.8), we obtain the inequality 

Yi < ((1 - a)-i + o{imZ^Zi)s\n-^f\ 
Hence, on the event {Zi < A}, we have K2{X) < ((1 — a)^^ + o(l))x. Here 

we denote x= ^^s+i)/3 Ylk=2^\^'^ ^^'^ = (-^i-^fc)^^*- Now we see that (iii) 
implies Pe = o(l), and, therefore, (4.9) follows from (4.10). 

In the remaining part of the proof, we show that (i) and (ii), together 
with the condition s = 0(1), imply (iii). For s > 2, we write, by Holder's 
inequality, 

/ n \ / n \ l/« 



(lijv 



k=2 \k=2 
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and observe that (4.12) implies 

{siy^/'x < (A(x))('^'^/^(Kl(x))'/^ 

For s = 1, we have >c = k,i[X). Now (iv) and (v), which fohow from (i) and 
(ii), imply that X = op(l) as m oo. □ 

Proof of Example 2. Let Di, D2 be independent random subsets 
which are uniformly distributed in the class of subsets of [m] = {1,2,..., m} 
of size x = {e + 0.5)m. Let s = 0.5m. Denote 

P*M=(^)'(7) v{m) = V{\D^r\D2\ = s), 

p"{m) = V{\Dir\D2\ >s). 

We show that for a sufficiently small absolute constant e G (0, 1), we have, 
as m — )• 00, 

(4.13) p*(m)=o(l), p"{m)=o{p*{m)). 

Observe that, by the first relation of (4.13), we can construct an increasing 
integer sequence {n^} such that nraP*{m) — )• 1 and nm{p*{m))'^ — )• as ?n — )• 
00. Hence, conditions (iv) and (v) of Theorem 2.2 are fulfilled with A = 1. 
In addition, by the second relation of (4.13), the average degree 'Eidm{vi) = 
{rim — l)p"{m) satisfies Edmivi) = o(l). Hence, dm{vi) = op(l), and this 
means that (2.5) fails. 

Let us prove (4.13). By Stirling's formula, as m — )• 00, we have (™) ~ 
2'"(0.57rm)-i/2 ^nd 

2 / \ 2{x-s) / \2s 



x\ X 0.5 + £ ,r 



sj \x — s J \^ J 2tts{x — s) vrem 

where = (1 + (2e)~^)^^(l + 2e) — )■ 1 as e — )■ 0. In particular, for a small 
£ G (0, 1), we have A,, < 2. The latter inequality implies p*{m) = o(l). 

Let us prove the second relation of (4.13). We denote p'^. = P{\Di n = 
r) and write 

(4.14) p"{m) = Y.p'r= p's (1 + Y.^p's-,k/p's)\ ■ 



It follows from the identities = ( ^ ) ( ) ( ^ ) " ^ and ^ 

that 



k=l 

p'g (s+fc)i.(m— 2x+s+fc)fe 



p'g ~ s^{m — 2x + s)^ ^' 

where = 4eV(l - 4e) as e — 7- 0. In particular, we have < 0.07 for 
£ < 0.1. Invoking these inequalities in (4.14) and observing that p'^ =p'{m), 
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we obtain, for e < 0.1, 

p{m) <p"{m) < l.lp'(m). 
We complete the proof of (4.13) by showing that p'{m) = o{p*{m)). We have 



x—s 



p'(m) (m — x)x-s fm — x\ , , , 

« < = (1 - 2e)^'" = o(l) asm^oo. 



p*{m) {m — s)x-s \m — sj D 

Before proving the statements related to clustering coefficient, we intro- 
duce some notation. For a real number o, we denote o+ = max{0,a}. Recall 
that V = {vi, . . . , Vn} is the vertex set of Gs{n, m, P), and Xi = \Di\ denotes 
the size of the attribute set Di oivi, 1 <i <n. By P(-) = P(-|I?i, X2, . . . , X^) 
we denote the conditional probability given Di and X2, ■ ■ ■ ,Xn. Write, for 
short, M = C^) and denote Yi = {-^;), Zi = YiU^/'^M-^/^ , and = BY^. 
We write Dij = Di n Dj and introduce the events £ij = {vi ^ Vj}, 

Ai = {Vi ~ V2,Vi ~ V3,V2^V3}, A2 = {\Di2\ = I-D13I = I-D23I = s} , 

A3 = {D12 = Di3 = D23} nA2, B = {Vi^ V2,Vi^ V3}, 

C = {\D23 \Di\>l, \Di2\ = \Di3\ = S}, Vij = {| Ajl > s + 1}. 

We have 

(4.15) a = as{m,P) = P{Ai)/P{B). 

Proof of Lemma 2. Here we consider the graph Gs{n,m,5x), where 
attribute sets of vertices are of size x, that is, Xi = x, 1 <i <n. We write 
p = P(|Ail=s). 

We note that (2.6) follows from (4.15) and the identities, which we prove 
below: 

(4.16) P{B) = P{vi ~ V2)Pivi ~ V3) =fil + o(l)), 

(4.17) P(^3) < P(^i) < P(^3) + o{f), 

(4.18) P{A3) = (':) V(l + o(l)). 



We obtain the first identity of (4.16) from the corresponding identity of 
conditional probabilities given Di that holds almost surely, 

P{B) = P{B\Di) = P{vi ~ V2\Di)P{vi ~ V3\Di) = P{vi ~ V2)P{vi ~ V3). 

The second identity of (4.16) follows from Lemma 6. In order to show (4.18), 
we write P(^3) =p*p, where = P(-D3 n {Di U D2) = Di2\\Di2\ = s), and 
evaluate 

_ {'m-2x + s)x-s P _(-. . 
^* (?) {m-x)x-s O ^^ + °^^^^(-)- 
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Let us prove (4.17). Since the event ^3 implies A2, and A2 implies ^1, we 
have 

(4.19) P{Ai) = PiAs) + P{A2 \ A3) + P{Ai \ A2). 

It remains to show that P(^2 \^3),P(^i \^2) = o{p). Noting that the 
event ^1 \ A2 implies at least one of events T>ij, 1 < i < j < 3, we write, by 
the union bound and symmetry, 

P(^i\^2)< Yl P(Ain^i) = 3P(Pi2n^i) 

i<«<i<3 

(4.20) 

<3P{Vi2r\{vi-^V3}). 
Now, conditioning on Di , we obtain from inequalities of Lemma 6 that 

P(Pl2 n {Vi ~ V3}\Di) = P{Vi2\Di) X P{vi ~ V3\Di) 

^ , X f m \ ^ f x\'^ I m 
<'. + lJ [s + l] 

(x — s) x\ 



{m- s){s + l) \ s J \s 
= o{f). 

We conclude that P(Pi2 n {vi ~ ^3}) = o{p). Hence, P(^i \ ^2) = o{p^). 
Next, observing that the event A2 \ A3 implies C, we write 

(4.21) P(^2\^3)<P(C) 
and evaluate P(C) =PiP2, where 

= P(|Z)23 \ ^ll > 1||^12| = \Dl3\ = S), P2 = P(I^12| = I^ISI = s). 

Here Pi is the probability that two independent subsets D2 \ Di and D3 \ Di 
of \ Di of size \D2 \ D\\ = \D3 \ Dil = x — s do intersect. By Lemma 6, 
Pi < (2; — s)'^{m — x)~^ . Hence, Pi = o(l). Finally, the simple inequality P2 < 
P{B) = (1 + o{l))p implies P(C) = o{f). □ 

Proof of Lemma 3. The proof goes along the lines of the proof of 
Lemma 2. 

In view of (4.15), relation (2.8) would follow if we show that for any 
e G (0, 1), we have, as m — )• oo, 

(4.22) (1 - 3e^ + o{l))a2alAr^ < P{B) < a2alM-^, 

(4.23) P(^3) < P(^i) < P(^3) + (e' + o{l))a2alM-^, 

(4.24) (1 - 2^2 + o(l))a?Af-2 < p(^3) < afikf-^. 
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We fix £ and introduce the indicator functions Ij = I{Xi<ev^} ^^"^ write 
li = l-li and I* =Ii][2l3. 

Let us show (4.22). We write P(^) = P(fi ~ f2)P(fi ~ fs) and apply 
(4.2) to each probability P(fi ~ Uj), i = 1,2. We have 

(4.25) YlY2Y^M~'^{l - r% < P{B% < P{B) < Y^YiY^M-'^ . 



Here 



r = l-\l- i^i-^) + {X2-s)+ \ / (Xi - s) + (X3 - s) 



m- Xi + l J \ m- Xi + l 

Observing that I* = 1 implies {Xi — s)+ < ey/m for each i, we write \r < 
2e^(l — m~^/^)~^. Now (4.25) implies the inequalities 

Y^Y2Y:iM~'^{l - 2e\l - m~'^l\^ - (1 - I*)) < V(B) 

(4.26) 

<YlY2Y-ihr'^ . 

Finally, taking the expected values and using the simple bounds 
Eyi2y2y3M-2(l - 1,) < Eyi2y2y3M-2(Ii +I2 +I3) 

(4.27) 

= o(l)a2a?M-2, 

we obtain (4.22). In the very last last step of (4.27) we used (2.7) in the 
form Ey^Ij = o(afc). 

In order to show (4.23), we combine (4.19), (4.20) and (4.21) with the 
following inequalities: 

P(Pi2 n {vx ~ v^}) < (e^ + o{l))a2alM-^, 

(4.28) 

P{C)<{e^ + o{l))a2ajM-^. 

Let us consider the first probability. Since the event P12 implies {vi ~ ^2}, 
we can write 

(4.29) P(Pi2 n {vi ~ V3}) < P{Vi2 n {vi ~ V3}% + P{B){1 - I*). 
Next, we apply (4.2) to each probability of the right-hand side, 

(4.30) PiV,2n{v,^v,})<^-£^^^, P{B)<^^, 

and write ^"^11^^'' = where r' = ^'^^i^l'^'^^^'^^^'^ ■ Finally, collecting 

inequalities (4.30) and r'l^, < in (4.29) and then taking the expected value 
in (4.29), we obtain the first bound of (4.28). In order to prove the second 
bound of (4.28), we write 

(4.31) P(C) =PiP2<PlP2^* +]52(1 - I,), 
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where = P(|Z)23 \ Di\ > l\\Di2\ = l^isl = s) and P2 = P(|I?i2| = l^^isl = 
s), and apply (4.2) to Pi and P2, 

^ {X2-s)+{X3-s)+ ^ Y^Y^Y^ 

We collect these inequalities in (4.31) and observe that p^^I* < e^/(l — m~"^/^). 
Now taking the expected value in (4.31), we obtain the second bound of (4.28). 

Let us show (4.24). We write ^3 in the form £1 n £2; where Ci = {D3 n 
{Di U D2) = Di2}., C2 = {1^12! = s], and decompose P(^3) = p^p^- Here 
^3 = P(£i|£2) and p^ = P(£2) satisfy 

(4.32) (1 - f,)Y,M^' <p,< Y,M-\ f, := + - ^'^(^^ - ^) ^ 

m - A3 

(4.33) il-f,)Y,Y2M-'<p,<Y,Y2M-\ (Xi - sU{X2- sU ^ 

m — Xi + 1 

Note that (4.33) is an immediate consequence of (4.2). In order to get (4.32), 
we write 

/ m—X\—X2+s > 

(4.34) P3 = - 



Xi-s ) _a^^ {m-Xi-X2+s)xi- 



and apply to 9 the chain of inequalities 

{a)r ^ {a-ry b V^^ br 



{a + b)r {a + b — rY \ a + b — rj a + b — r 

Combining (4.34), (4.32), (4.33) and P(^3) =^3^4, we write 

(4.35) (1 - f3)(l - u)YiY2YsM-\ < P{A3% < PiAs) < YiY2Y^M-\ 

Finally, invoking in (4.35) the inequalities fjl* < e^(l — m~^^'^)~'^ and then 
taking the expected values, we obtain (4.24). □ 

Proof of Theorem 2.3. In the case where (ii') holds, we apply Lemma 3. 
Note that conditions (i) and (ii') imply the uniform integrability of the se- 
quences of random variables {Z^i}m and {Zmi}m', that is, for A: = 1,2, we 
have 

(4.36) Ve > 0, 3A > such that Vm BZ^^I{z„,^^^y < e; 

see Lemma 7. Hence, (2.7) holds, and Theorem 2.3 follows from Lemma 3. 

Now assume that EZ^ = +oo. In order to prove that a = o(l), we show 
[see (4.15)] that 

(4.37) P{Ai) = 0{n-^) and liminf n^P(i3) = +oo. 
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Before the proof of (4.37), we introduce some notation. Denote 

h^ip{h)=supEX^^I{x^^>h}, eh = (^(/i^/^), 5^ = max{/i^\e,,}, 

m 

and observe, that (i), (ii) and (2.9) imply ip{h) = o(l) as /i — t- oo. Hence, 
6h = o(l) as /i — 7- +00. Furthermore, by Chebyshev's inequality, 

P{Xmi > m6m) < (m(5m)" V("i'5m) < {mdmye-m < m'^ell^ 

(4.38) 

= o(m"'*). 

For = 2,3 and t = 1,3, denote := \ -Di and introduce the events 
nt = {\Dt\<m5^}, 

Qi = {\Dir\D2r\D^\=i}, Q* = {\Dlr\Dl\>s-i}, 0<i<s-l, 

Qs = {\Dir\D2r\D^\>s}, Qi = {\Dir\Dl\>Q}, n = ninn3. 

Let us prove the first bound of (4.37). We denote R = P{Ai n V.) and write 

(4.39) P(^i) = P{Ai nn) + R = Po + ---+p, + R, 

where = P{B Ci Qi Ci Q* Ci Ti) . Note that the remainder R is negligibly 
small, 

R < Piv2 ~ V3)P(Wl) + Pivi ~ V2)Pin3) = o(m-2^). 

In the last step, we used (4.38) and the inequality P{v2 ~ v^) < afM'^, 
which follows from (4.2). Let us estimate = P(Q* \BnQin n)P{B n Qi n 
%) =: PiiPi2- Note that, on the event BCiTii, the sets D2 and are random 
subsets ofW\Di of sizes \Dl\ <Xk — s, where | \ Di| > m(l — 6m) =■ m' 
is of order (1 — o{l))m. By (4.2), the probability that their intersection has 

at least s — i elements is bounded from above by E( ^^.)( "^^Of •) • Hence, 
Pj^i = 0(m*~*). Next, we estimate 

Pi2 < P{{vi ~ ^^2} n Qi n ns) = p{\d*2* nDl*\>s-i\Qin ?^3)P(Q^ n 7^3) 

= -Pi2Pi2- 



Here the random subsets D'^* := \ of \ Ds are of sizes at most 
— i< Xk, k = 1,2. Since on the event Ti^ we have Dsl > m' , inequal- 
ity (4.2) implies p[2 < = 0{m'-'). Finally, we estimate 
p'l2- Let 5 count the number of subsets Ai of W of size i covered by the 
intersection n D2 H D3, that is, 5 = Z^^-ciy ^{AiCDinL>2nL>3}- We have 

-2 



p'l2<P{S>l)<BS={'^]{P{AeD^)f=r'] (E 
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Hence, = 0(m~^*). Collecting these bounds, we conclude that 

Pi = PiiPi2 < PiiPi2Pi2 = 0{m-^'). 

Now the first bound of (4.37) follows from (4.39). 

Let us prove the second identity of (4.37). Given t < m, denote In = 
I{Xi<t} ^-iid write [see (4.2)] 

P(i3) = EP(i3) > EIuhthtP{B) 

(4.40) 

> M-^BIuhthtY,^Y2Ys{l - t^/{m - t)f. 

We have, by (i) and (ii), that Elj^li — )• ai as m,t ^ +00. Furthermore, 
(i) and EZ^ = 00 implies Ey^^Ii( — )• +00 as m,t ^ 00. Therefore, letting 
t = tm^ +00 and tm = o(m^/^), we obtain (4.37). □ 

Proof of Remark 2. Here we assume that /3m := i"^)n~^ tend to 
+00 as m,?i— T-oo. In the case where (i), (ii) and (ii') hold, the relation 
as{m, P) = 0(1) is an immediate consequence of Lemma 3. In the case where 
(i) and (ii) hold and EZ^ = 00, we show (4.37) by the same argument as that 
used in the proof of Theorem 2.3 above. In particular, in the proof of the 
first bound of (4.37), we choose 6m = /3rn'^^^^\ and, for Tm = ("'f'" )/3m^^^, 
we write [cf. (4.38)] 

F{Xml > m6m) = P{Zml > Tm) < T~^'EZmll{Zmi>Tm} = o{Tm)- 

Here < c{s)m~^/'^n~^/'^5^ = o{n~^). In the proof of the second identity 
of (4.37), we choose t = tm = m^l'^rT^I'^'^^'^ in (4.40). Finally, from (4.37) 
and (4.15) we deduce the relation as{m,P) = o(l). □ 

Proof of Theorem 2.4. We start with some notation. By /^(A) = 
e~^X^/i[ we denote Poisson's probability. Observing that the absolute value 
of the derivative of the function A — )• /i(A) is bounded by 1, we write |/i(Ai) — 
/i(A2)| < |Ai — A2I, by the mean value theorem. We denote 

Hi = Ar^YiY2Y3 = n-2/3;,V2^iZ2Z3, H2 = n'^Z^ZsZa, 

Hi = EZi, and ^m = M/n, and introduce the event /C = {'^^=4^{vir^Vj} = 
k — 2}. We also note that, for i = 1,2, the moments EZ| and = 'EYf = 

—i/2 

/3m EZ\ are bounded from above and that they are bounded away from 
zero as m, n — )• 00. 

Let us prove (2.14). Given integer k>2, we write 
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where p* = P(^i n K,) and p** = Y*{B n /C). We shall show that, for any 
eG(0,l), 

(4.42) p* = 71-2/3-1/2^,, (fe - l)E/fc_i(^iiZi) + 

(4.43) p** = n-\k - l)kBfk{fiiZi) + i?2, 
where the remainder terms are negligibly small, 

\Ri\ < n^^c{l3){a2al + al + af){e + o{l)), 

\R2\ < n-2c(/3)(a2a? + af )(e + o(l)). 

Here the number c(/3) depends only on f3. Note that (i) and (ii) imply /xi — )• /i 
and the convergence in distribution fJ^iZi — t- fiZ. Hence, we have the conver- 
gence of the expectations E/j(/iiZi) — ^ E/j(/iZ) =pj. Now (4.42), (4.43) and 
(2.9) imply p* = n~'^ fi~'^/'^ ^{k - l)pk~i + o{n~'^) and p** = n~'^{k - l)kpk + 
0(77.-^). Substitution of these identities into (4.41) shows (2.14). 

In the remaining part of the proof, we show (4.42) and (4.43) for a given 
e G (0, 1). For this purpose, we write 

(4.44) p* = EP(^i)P(/C), p** = EP(e)P(/C) 

and show that the conditional probabilities P(^i), P(;B) and P(/C) can be 
approximated by 

(4.45) P(/C)«A_2(/iiZi), P{Ai)^Hi, P{B)^H2. 

Note that substitution of (4.45) into (4.44) gives the leading terms of (4.42) 
and (4.43). In the remaining part of the proof, we show the validity of such 
an approximation. 

Note that since conditions of Theorem 2.4 are more restrictive than those 
of Lemma 3 (see the proof of Theorem 2.3), we can rightfully use the in- 
equalities established in the proof of Lemma 3. 

Approximation of P(/C). We first establish an upper bound for A = 

P(/C)-/fc_2(/Ul^l), 

(4.46) |A| < 2ki + K2 + l/i- /Uil^"!. 
Here 

n n 

jl = n~^''^^ Zi, Ki = Zfzf 



72 

i=4 i=4 



K2 = n 2_^ZiZi 



m-Xi + l 

i=A 



We show (4.46) in two steps. In the first step, we apply Le Cam's inequality 
(4.1) to the sum Yll=A^{fi'^Vi} of conditionally independent random indica- 
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tors, 
(4.47) 



In the second step, using the mean value theorem, we replace the argument 
of fk~2 by jJ-iZi in (4.47). The approximation error estimate (4.46) now 
follows from the inequalities 

n n 

(4.48) Y.(P{£u)f<Ku ~^Zi-k2<Y,P{8u)<f^Zi, 

which are simple consequences of the inequalities of Lemma 6. 

Next, using (4.46), we replace -P(/C) by /a;_2(/Ui^i) in (4.44) and show that 
the error of the replacement is negligibly small. We denote \a = I{Zi<yl} ^^^^ 

= 1 — Iy4 and observe, that in view of (4.36), we can find A>\ such that, 
uniformly in m, 

(4.49) EZflyi < eEZl, k = l,2. 
We write 



(4.50) 
(4.51) 
where 



P(^i)P(/C) = P{Ai)fk^2{f^iZi)lA + n + r2, 
P(e)P(/C) = P{B)fk^2{f^iZi)lA + rs + r4. 



n = P{Ai)P{JC)lA, 
rg = P{B)P{IC)Ia, 



r2 = P(A)AlA, 
n = P{B)AIa, 



and show that 
(4.52) 



Elr 



a2al 



for i 



1,3, 



-o{M-^n~^)al for j = 2,4. 

To prove (4.52), we write, using the inequalities P(^i) < P{B) and P{13) < 
H2 [see (4.2)], 

(4.53) n < H2IA for i = l,3 and |rj| < i?2| A|Ia for j = 2, 4 

and notice that, for i = 1,3, inequalities (4.52) follow from (4.49) and, for 
j = 2,4, inequalities (4.52) follow from the bound EZ^|A|Ia = o(l). Let us 
prove this bound. We note that (4.46), combined with the simple inequality 
|A| < 1, implies 

|A| < 1 A {2ki) + 1 A (Zi|/i - /ill) + 1 Ak2. 
Here we use the notation a Ab = min{a, b}. Hence, for j4 > 1, we can write 

(4.54) Zf\A\lA < ca{1 a Ki + 1 a l/i - /ill + 1 a K2), 
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where the number ca > does not depend on m, and where 

n n 

k\ := Z?, ^2 := n^^m~^ Zi{Xi — s). 

j=4 i=4 

The bound EZ^|A|Iyi = o(l) follows from (4.54) and from the bounds 

(4.55) k\ = op{1), fl- fii = op{l), K*2=op{l). 

The first and second bounds of (4.55) are shown in Lemma 7. To obtain the 
third bound, we write Ek^ < m'^'EZ^X^ = o(l); see (i), (ii'). 

Approximation of P(^i). Now we replace P(^i) by Hi in the right- 
hand side of (4.50) and show that the error of the replacement is negligibly 
small. We proceed in two steps. First, we approximate P(^i) ~ P(^3), 
and, second, we approximate P(^3) ~ Hi. In the first step, we combine the 
inequalities [see (4.20), (4.21) and (4.28)] 

P(^3) < P(^l) = P(^3) + P(A \ A2) + P(^2 \ ^3), 

EP(^i \ A2) < 3P(Pi2 n {vi ~ V3}) < 3{e^ + o{l))M-^a2al 
EP(^2 \ ^3) < P(C) < (e^ + o{l))M-^a2al 
and show that 

P(^3)/fc-2(W^l)lA < P(^l)A-2(/Ul^l)IlA 

(4.56) 

<P(^3)/fc-2(w^l)lA + r5, 

where Ers < 4(e^ + o(l))M~^a2a^. In the second step, we apply (4.35) and 
write 

(4.57) Hifk^2{f^iZi)lA - re < P(^3)/fe-2(/"i^i)lA < Fi/fc„2(/"i^i)lA, 

where Erg < (2e^ + o(l))M~^ap Finally, using the inequality 1 — lyi = I^i < 
1, we write 

(4.58) EHifk-2{l^iZi) - r7 = EHifk-2{^iiZi)lA < EHifk_2{f^iZi), 

where ry = Eifi/fc_2(/iiZi)lA < Ei^il^ < eM~'^a\. In the last step, we 
used (4.49). Finally, from (4.44), (4.50), (4.56), (4.57) and (4.58) we ob- 
tain (4.42). 

Approximation of P(S). Now we replace P(S) by H2 in the right-hand 
side of (4.51). For this purpose, we combine (4.25) and (4.26) with the in- 
equality fk~2ifJ'Zi) < 1 and obtain 

(4.59) H2fk-2{l^iZi)lA - rg < P(i3) A,_2(/Ui^i)Ia < i?2/fc-2(m^i)IlA, 

where rg < (3e + (1 — I^))H2 is negligibly small, because Erg < (3e + o(l)) x 
M~^a2af . Next, we proceed as in (4.58) above and write 

(4.60) EH2fk-2{f^iZi) - rg = EH2fk-2if^iZi)lA < Ei?2/fc-2(w^i), 

where rg = Ei?2/fc-2(/^i^i)lA < EH2IA < eM~'^a2al. Finally, from (4.44), 
(4.51), (4.59) and (4.60) we obtain (4.43). □ 
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4.3. Passive graph. 

Proof of Theorem 3.1. The proof consists of two parts. In the first 
part, we estabUsh the convergence of the Fourier transforms 

(4.61) hm Ee**^ = Ee**'^-. 

In the second part, we show that, as m — )• oo, 

(4.62) P(d/L) = o(l). 

Part 1. Here we prove (4.61). Before the proof, we introduce some nota- 
tion. We denote Ij{w) =I|u,g£)^,} and write 

(4.63) L = {Xi- 1)+Ii(u;i) + • • • + (X„ - 1)+I„(u;i). 

Here we use the notation a+ = max{0, a}. Given X = {Xi, . . . , X„), we gen- 
erate independent Poisson random variables ryi(Xi), . . . ,r/„,(X„) with mean 
values E{rij{Xj)\Xj) = m~^Xj. We also generate independent random vari- 
ables Yi (X) , I2 (X) ; • • • with the common probability distribution defined as 
follows. In the case where S{X) = Xi -|- • • • -|- X„ is positive, we put 

n 

P(yi(X) = j\X) = {j + 1)-^, = Ylhx,=j+i}, J = 0, 1, ... . 

For S{X) = 0, we put yi(X) = 0. By . . . (resp., Yi,Y2, . . .) we denote 
the corresponding unconditional random variables, that is, r]j is the outcome 
of a two-step procedure: first, we generate X and then generate r]j{Xj). Let 
r/(X) = r]i{Xi) -!-••• + rjn{Xn) and rj = rji + ■ ■ ■ + rjn. Introduce the random 
variables 

■n 

(4.64) r = (Xi - l)+77i + • • • + (X„ - l)+r/„, C = 

i=i 

and observe that, given X, their conditional distributions Pt{X) and -P^(X) 
are compound Poisson and that Pr{X) = Pg(X). Here and below in this proof, 
by -P((X) (and ExC) we denote the conditional distribution (and conditional 
expectation) of a random variable given X . 

Let us prove (4.61). For this purpose, we write Ee**'^ — Ee**"^* = Ii + I2, 
where h = Ee**^ - Ee**« and h = Ee**? - Ee**"'* , and show that /i, /2 = o(l). 
In order to show the first bound, we write 

h = E(Exe**^ - Exe**«) = EA, A = (Exe**^ - Exe^*^) 

and invoke the inequalities 

n 

(4.65) 2-i|Exe^*^-Exe^*"| <dTv(i^L(X),/V(X))<2m-2^X|. 
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Here c^tv denotes the total variation distance. Indeed, we obtain from (4.65) 
that, for any e > 0, 

/ n 

2 



<e + 2P(|A| > e) < e + 2F [ 4m-^"^Xf > e ) =e + o{l). 



In the last step, we applied (4.5). Hence, Ii = o(l). Let us prove (4.65). 
The first inequality of (4.65) follows immediately from the definition of the 
total variation distance. The second one is a simple consequence of LeCam's 
inequality (4.1). Indeed, we have, by the triangle inequality, 

n-1 
j=0 

where tq = L, Tn= t, and, for 1 < j < n — 1, 

j n 

T, = Y^iXt - + (Xi - i)^Uw^). 
t=i t=j+i 

Furthermore, noting that the sums Tj and Tj+i differ only by one term, we 
obtain 

dTv(^'r,(X),P,^.+,(X)) < dTv(fll,+,(X),P,,^,(X)). 

Finally, invoking the inequality (iTv(-Pnj+i (X), -Pr)j_|_i (X)) <2m~'^X^_^i [see 
(4.1)], we arrive at (4.65). 

The proof of the second bound I2 = o(l) is based on the fact that r/ — )■ A 
and Yi — t- Zi in probability as m — t- 00. Details of the proof are given in the 
separate Lemma 8. 

Part 2. We write, by inclusion-exclusion, L > d> L — T, where the num- 
ber 

^<i<j<nweW\{wi} 

is at least as large as the number of vertices w having two or more links to 
wi. Hence, we have P(d ^ L) < P(r > 1). In order to prove (4.62), we show 
that P(T > 1) < e + o(l) for every e E (0, 1). For this purpose, we write 

P(r > 1) = EP(r > 1|X) < e + P(P(r > 1|X) >e)<e + P(ExT > e) 

and invoke the bound ExT = op(l). Let us prove this bound. We write 



l<i<j<n 

and use the bound n~^^'^{Xf + • • • + X^) = op(l), which follows from (4.5) 
(applied to random variables xf^^ , 1 <i <n, and a = 3/2). □ 



DEGREE AND CLUSTERING 



31 



Lemma 8. Assume that the conditions of Theorem 3.1 hold. For ^ de- 
fined in (4-64) and every real t, we have liuim^+oo Ee**^ = Ee**''* . 

Proof. We write h = Ee^*« - Ee**'^* . 

The cases EZ = and EZ > are considered separately. In the case where 
EZ = 0, the random variable d^, is degenerate, P((i* = 0) = 1 and the lemma 
follows from the relation 

P(^ = 0) > P(7? = 0) = EP(?? = 0|X) = Ee-^(^)/'" = 1 - o(l). 

In the last step, we combined the bound n~^S{X) — 'EZ = op{l) [see (4.4)] 
with EZ = 0. 

Next, we consider the case where EZ > 0. We need some more notation. 
Denote 

/(t) = Ee^*-^* , /x(t) = Exe^*^i , /. (t) = Ee^*^i , 
Ax = Ex??, A* = EA, 

A = Exe^*« - /(t), 5 = (hit) - l)Ax - (Mt) - 1)A, 

and write the Fourier transforms of the compound Poisson distributions 
Pd, and P^{X) in the form Ee**''* = e(^*(*)-i)-^* and Exe**? = e^Mt)-^)^x^ 
respectively. Introduce the events 

A{e) = {\n~^S{X) - EZ\ < 2^^emin{l,/3,EZ}}, e > 0, 

and the function t — )• a{t) = sup^{(EXi)^^EXiI|jS(^j>j}}. It follows from 
Lemma 7 that conditions (vii) and (viii) imply that n~^S{X) — EZ = op{l) 
as m — )■ oo and a{t) = o(l) as i — )• +oo. Hence, we have 

(4.66) Ve>0 P(^(e)) = 1 - o(l) asm^oo. 

In addition, for every e > 0, we can choose a positive integer t^ such that 

(4.67) a{te) < e and {EZy^EZI{z>t,} < e- 

In order to prove I2 = o(l), we show that there exists a number c > 0, 
depending only on EZ and /3, such that, for any e S (0, 1), 

(4.68) limsup I/2I < ce. 

n 

In the proof of (4.68), we assume that m, n are so large that /3 < 2m/n < 4/3 
and EXi < 2EZ < AEXi. In particular, on the event A{e), we have 

(4.69) m-'^S{X) <3(3-'^EZ and S{X) > 2~^nEZ > 4-'^nEXi. 
We fix e and write 

(4.70) /2 = EA = /2i+l22, /2i = EAI^(,), /22 = EA(1-I^(,)). 
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Note that | A| < 2 (as the absolute value of the Fourier transform of a prob- 
ability distribution is at most 1). This inequality, together with (4.66), im- 
plies I22 = 0(1) as m — )• 00. Next, we estimate l2i- Combining the identity 
A = /(t)(e'^-l) with the inequalities \f{t)\ < 1 and le-'-ll < Efc>i I'^l''/^' ^ 
l^lel*^! , we obtain 

|/2i|<E|5|el^ll^(,)<e8^'E|5|I^(,). 

In the last step, we used the bound \5\ < 8A*, which follows from the in- 
equality \5\ < 2Ax + and the inequality Ax < 3A*; see (4.69). 
Next, we show that E|(5|I[_4(£) < (2 + 7X^)e + o(l). We write 

6 = (/x(t) - l)(Ax - A.) + (/x(t) - Mt))X., 

estimate \5\ < 2|Ax — A*| + X*\fx(t) — f*{t)\ and substitute 

|Ax - A,| < \n~^S{X) - BZ\nm"^ + |nm"^ - ^~'^\BZ. 

In this way, we obtain E|(5|I^(£') < 2/31 + 2/32 + A* 133, where 

/31 = nm-^B\n-^S{X) - EZ|I^(^) < e, 

(4.71) 132 = |nm-^ -/3-^|EZ = o(l), 

l33 = E|/x(t)-/*(t)|I^(,). 

It remains to estimate 733. We expand fx{t) — f*{t) = Ri + R2 + R3, where 
Ri= e'^^5p{k), 5p{k) = V{Yi{X) = k\X)-V{Zi = k), 

0<k<te 

(4.72) \R2\ < P(Zi > + 1) = {EZ)-^EZI{z>t^^2} < e, 

n 

\R3\ < P{Yi{X) >t, + 1|X) = {S{X))Y,Xj\x,>t^+2} 
and observe that the last inequality of (4.69) implies 

(4.73) E\R3\Ia{s) < 4{EXi)-^E(^n-^J2XjI{x,>t,+2}^ < M^e + 2) < 4e. 

Finally, we write < J2o<k<te \^pi^)\ expand 6p{k) = Yl'j=i + 1)) 
where 

k / n 1 \ 

ri{k) = -g^i^k - np^{k)), r2{k) = kp^{k) ( - ^ J > 

and r3{k) = ^{px{k) — Pz{k))- Here we denote Px{k) = P{Xi = k) and 
Pz{k) =P{Z = k). Since the number te is fixed, it follows from (vii) that 

(4.74) J2 k3(fc + l)| 

= 0(1) as m — 7- 00. 

0<k<ts 
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Furthermore, on the event A{£), we obtain from (4.69) that 



|ri(fc)| <4{EXiy^k 



Pk{x) 

n 



\r2{k)\<2e{EXiy^kpk{x) 



Hence, we have X]o<fe<t£ k2(fc + l)|I^(e) < 2e and 

E ^ h(fc + l)|I^(,)<^ kUl"^ - p,{x))\'' 

- EXi ^ 

= o(l) 

as m — 7- oo. We conclude that E|i?i|I_4(£) < 2e + o(l). This bound, together 
with (4.72) and (4.73), shows the bound I33 < 7e + o(l). The proof is com- 
plete. □ 

Proof of Lemma 4. Let us prove (3.2). We only consider the case 
where em(max{l, n*})-^ > 2, since otherwise (3.2) is obvious. Given < 
k < 2~^em, introduce the random variables 

n n 

<em/k} ) >em/fc} 
1=1 1=1 

and note that the equality of events {d>l} = {dAik) > 1} U {dsik) > 1} 
implies 

(4.75) P{dB{k) > 1) > P{d > 1) - P{dA{k) > 1). 

Combining (4.75) with the inequalities P{d > emk~^ — 1) > P((i_B(A;) > 1) 
and 

n 

P{dA{k) > 1) < E{E{dA{k)\Xi,. . .,Xn)) = E J^X,m-il|2<x,<.m/fc} < en,k-\ 

i=l 

we obtain P((i > £mk~^ — 1) > P{d > 1) — en^k^^. Finally, we choose k = 
max{l,n*} and obtain (3.2). 

The proof of (3.1) is much the same, but now we choose k = n. 

Let us prove (3.3). Given N = Y17=i ^{Xi>2}j let Y/v be a random variable 
with binomial distribution Bin(A^, 2m-i). We have 

P{d >C)> P{Yn >C)> P(Yn > C\N > 2-^n^)P{N > 2-^i*) = 1 - o(l). 

In the last step, we used the simple facts about the binomial distribution 
that EiV = n* oo implies P{N > 2-^n^) = 1 - o(l) and that B{Yn\N = 
2^^n^) = n^,m~^ — )■ +00 implies 

P{Yn > C\N > 2~^n^) > P{Yn > C\N = 2~^n^) = 1 - o(l). □ 
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Proof of Theorem 3.2. The convergence of the degree distribution 
follows from Theorem 3.1. Assuming, in addition, that limm_!.oo EXj* = 
EZ^ < oo, we obtain the convergence of moments E(Xi)fc — t- E(Z)fc, k = 2,3, 
as m,n — ?■ oo. Hence, we can write (3.4) in the form 

*/ r>\ E(Z)3 

"("'"^'^) = rW)^FTE(z)^ + °^')- 

Invoking the identities E(Z)2 = (3Ed* and E(Z)3 = (3{E{d^)2 - (Ed*)^), we 
obtain (3.5). 

In the case where EZ^ < oo and EZ^ = oo, we have E{Xi)2 E(Z)2 < oo 
and E(Xi)3 — ;> oo. Therefore, (3.4) implies a* — ;> 1. □ 

Proof of Theorem 3.3. In the proof, we use some new notation. Let 
{^2,^3} be a random pair of vertices uniformly distributed in the set of 
all pairs from W \ {wi}. Let 6 = {61,62, ■■■ iSn) denote an ordered collec- 
tion of subsets of W. We call 6 a set-valued vector. Introduce the random 
set-valued vector D = {D^, D2, ■ . ■ , D^), where D* = Di for wi G Di and 
D* = otherwise. Denote X = {XI,. . .,X*), where X* = \D*\. Introduce 

the function 6 — )• h{6) = 2X]r=i('^'2~^)^{5i7^'2'} ^^"^ random variable H = 
h{D) = Yll=ii-^i ~ l)2l{x*>i}- A collection X of set-valued vectors is identi- 
fied with the event D€l.We denote Si = Eh{D)Ix = E^ex ^(^)PP = 
By Pj we denote the conditional probability given the event {D = 6}. De- 
note nj = ^"^^I|5^=0}. 

Now we fix an integer k >2 such that P{d^ = k) > and assume that 
m,n are so large that P{d = k) > 2^^P{d^ =k) >0. Introduce the events 

A = ~ if^i ~ w^3, 1^2 ~ tf3}, A = {d = k}, Ak = {d = L,L = k} 
and the probabilities 

= P(wi ~ W2,wi ~ wl,d = k), 

= P(wi '-^ W2,wi'^ w%^,d = k,d^ L). 

Here {w2^w\} is a random subset of \ {wi} uniformly distributed in the 
class of subsets of size 2. We assume that {w2,w\} is independent of the 
random sets Di, . . . ,Dn defining the intersection graph. 
In order to prove (3.6), we write a*!*^! in the form 

_ P(An^) 



a 



and invoke the identities 

(4.76) P(A n ^) = \ Ed2,\d,=k} + o(m-2), 



k\ ( m — 1 



(4.77) ^'^=( 2 jl ""2 ' ) P{d.=k)+o{m-'). 



-1 
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In order to show (4.77), we note that every pair {w',w"} C W \ {wi} has 
the same probabihty to be covered by the neighborhood of wi. Hence, by 
symmetry we have 

^1 



(4.78) 



Now (4.77) follows from the first identity and the convergence P(d = A;) — )■ 
P(4 =fe). 

Let us show (4.76). Observe that P{AnAn{d^ L}) <p^, and by (4.78) 
and (4.62) we have = o{m~'^). Hence, 

(4.79) P(A n^) = P(An A) + o(m"2). 
Next, we expand, by the total probability formula, 

(4.80) P(A n A) = ^ P5(A)P(:d = 6) 

SeAk 

and note that Pj(%) < Pji^) < Psi^j) + ^sC^s)^ where 

Cj = {^Dj -.w^^w^ G Dj and w\ ^ -Dj}, Dj = {^Dj -.wx^w^i^w^ E 
Observe that 

m 



(4.81) Pt(Pt) 



(m — 1) 



2 



and, since nj<n, 

(4.82) P.(C.) = 1 - f 1 - ^(^y < = 0{m-'). 

^ ^ V (^-1)2/ ("1-1)2 

Here X' denotes the random variable | Di \ conditioned on the event wi^ Di. 
Collecting (4.81) and (4.82) in (4.80), we obtain 

(4.83) P(An.4fc)= , \, cS^,+0(m"3). 

(m- 1)2 

Now we replace the event Ak by A in (4.83). For the left-hand side, we 
apply (4.79). For the right-hand side, we apply the inequalities 

(4.84) Sa>Sa,>Sa- (':]p{d = k,dy^L)=SA-o{l) 



[here we used h{6) < (3) and (4.62)]. We obtain 



-2' 

1 



P(A n ^) = , \, Sa + o(m-2). 
(m- 1)2 
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Finally, (4.76) follows by the convergence of 5^ = EHI^^^f^y to E(i2*I{d,=fc} 
as m,n— )• oo. We derive this convergence from the weak convergence of 
bivariate random vectors {H,L) — ?■ (ii2*,d*), which is obtained using the 
same argument as that of the proof of Theorem 3.1. □ 
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